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Chapter Three 
Describing Data: Numerical Measures 

GOALS 

When you have completed this chapter, you will be able to: 

ONE 

Calculate the arithmetic mean, median, mode, weighted 
mean, and the geometric mean. 

TWO 

Explain the characteristics, uses, advantages, and 
disadvantages of each measure of location. 
THREE 

Identify the position of the arithmetic mean, median, 
and mode for both a symmetrical and a skewed 
distribution. 



Goals 
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Chapter Three 

Describing Data: Numerical Measures 
FOUR 

Compute and interpret the range, the mean deviation, the 
variance, and the standard deviation of ungrouped data. 

FIVE 

Explain the characteristics, uses, advantages, and 
disadvantages of each measure of dispersion. 

SIX 

Understand Chebyshev's theorem and the Empirical Rule as 
they relate to a set of observations. 



Goals 
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The Arithmetic Mean 

is the most widely used 
measure of location and 
shows the central value of the 
data. 



It is calculated by 
summing the values 
and dividing by the 
number of values. . 



The major characteristics of the mean are: 



Averagi 



Jl values are used, 
t is unique. 

'he sum of the deviations from the mean is 0. 



Characteristics of the Mean 



For ungrouped data, the 

Population Mean is the 

sum of all the population 
values divided by the total 
number of population 
values: 



where 

|n is the population mean 

N is the total number of observations. 

X is a particular value. 

S indicates the operation of adding. 
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Population Mean 



A Parameter is a measurable characteristic of a 3 6 

population. 



The Kiers 
family owns 
four cars. The 
following is the 
current mileage 
on each of the 
four cars. 




56,000 
42,000 

23,000 
73,000 





56,000 + ...+ 73,000 
u = — — = — = 48,500 

N 4 



Example 1 
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For ungrouped data, the sample mean is 
the sum of all the sample values divided 
by the number of sample values: 




where n is the total number of 
values in the sample. 
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A statistic is a measurable characteristic of a sample. 



A sample of 
five 

executives 
received the 
following 
bonus last 
year ($000): 




Example 2 
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Every set of interval-level and ratio-level data has a 
mean. 

All the values are included in computing the mean. 

A set of data has a unique mean. 

The mean is affected by unusually large or small 
data values. 

The arithmetic mean is the only measure of location 
where the sum of the deviations of each value from 

the mean is zero. 



Properties of the Arithmetic Mean 
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Consider the set of values: 3, 8, and 4. 

The meHB is 5. Illustrating the fifth 

property 



E(X-X) = [(3-5)+(8-5)+(4-5)]=0 



Example 3 
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The Weighted Mean of a set of 

numbers X v X 2 , X n , with 
corresponding weights w v w 2 , 
...,w n , is computed from the 
following formula: 



- (w 1 X 1 +w 2 X 2 +...+ w n X n ) 

Aw — 



(w { +w 2 +..w n ) 



Weighted Mean 



During a one hour period on a 
hot Saturday afternoon cabana 
boy Chris served fifty drinks. 
He sold five drinks for $0.50, 
fifteen for $0.75, fifteen for 
$0.90, and fifteen for $1.10. 
Compute the weighted mean of 
the price of the drinks. 



- 5($0.50) + 15($0.75) + 15($0.90) + 15($ 1 . 15) 



X.= 



W 



5 + 15 + 15 + 15 



$44.50 
50 



= $0.89 



Example 4 
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The Median is the 

midpoint of the values after 
they have been ordered from 
the smallest to the largest. 



There are as many 
values above the 
median as below it in 
the data array. 



For an even set of values, the median will be the 
arithmetic average of the two middle numbers and is 
found at the (n+l)/2 ranked observation. 



The Median 
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The ages for a sample of five college students are: 

21, 25, 19, 20, 22. 



Arranging the data 
in ascending order 
gives: 

19, 20, 21, 22, 25. 

Thus the median is 
21. 




The median (continued) 
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are: 76,73,80,75. 




rhe median is found I 
at the (n+l)/2 = I 
(4+l)/2 =2.5 th data I 
point. I 



Arranging the data in 
ascending order gives: 



73, 75, 76, 80 



Thus the median is 75.5, 



Example 5 
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Properties of the Median 



oThere is a unique median for each data set. 

olt is not affected by extremely large or small 
values and is therefore a valuable measure of 
location when such values occur. 

olt can be computed for ratio-level, interval- 
level, and ordinal-level data. 



olt can be computed for an open-ended 
frequency distribution if the median does not 
lie in an open-ended class. 



Properties of the Median 



The Mode is another measure of location and 
represents the value of the observation that appears 
most frequently. 

Example 6: The exam scores for ten students are: 
81, 93, 84, 75, 68, 87, 81, 75, 81, 87. Because the score 
of 81 occurs the most often, it is the mode. 



Data can have more than one mode. If it has two 
modes, it is referred to as bimodal, three modes, 
trimodal, and the like. 



The Mode: Example 6 
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Symmetric distribution: A distribution having the 

same shape on either side of the center 



Skewed distribution: One whose shapes on either 
side of the center differ; a nonsymmetrical distribution. 





i i 




Can be positively or negatively skewed, or bimodal 



The Relative Positions of the Mean, Median, and Mode 
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Zero skewness Mean 




=Median 
=Mode 



The Relative Positions of the Mean, Median, and Mode: 

Symmetric Distribution 
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• Positively skewed Mean and median are to the right of the 
Mean>Median>Mode f 



IV^ode Me 
Median 



The Relative Positions of the Mean, Median, and Mode: 

Right Skewed Distribution 
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Negatively Skewed Mean and Median are to the left of the Mode. 



Mean<Median<Mode 



| Mean Mode 
Median 



The Relative Positions of the Mean, Median, and 
Mode: Left Skewed Distribution 
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The Geometric Mean 

(GM) of a set of n numbers 
is defined as the nth root 
of the product of the n 
numbers. The formula is: 



GM = ^/(Xi)(X2)(X 3 )...(X.) 



The geometric mean is used to 
average percents, indexes, and 
relatives. 



Geometric Mean 
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The interest rate on three bonds were 5, 21, and 4 
percent. 

The arithmetic mean is (5+21+4)/3 =10.0. 
The geometric mean is 



GM = V(5)(21)(4) 



The GM gives a more conservative 
profit figure because it is not 
heavily weighted by the rate of 
21 percent. 



Example 7 
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Growth in Sales 1999-2004 




Another use of the 
geometric mean is to 
determine the percent 
increase in sales, 
production or other 
business or economic 
series from one time 
period to another. 



* (Value at end of period) 
CjM — n\ 1 

(Value at beginning of period) 



Geometric Mean continued 
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The total number of females enrolled in American 
colleges increased from 755,000 in 1992 to 835,000 in 
2000. That is, the geometric mean rate of increase is 
1.27%. 



GM =8 



835,000 
755,000 



-1 = .0127 



Example 8 
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Measures of dispersion include the following: range, 

mean deviation, variance, and standard 
deviation. 

Range = Largest value - Smallest value 





Measures of Dispersion 
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The following represents the current year's Return on 
Equity of the 25 companies in an investor's portfolio. 




-8-1 


3.2 


5.9 


8.1 


12.3 




4.1 


6.3 


9.2 


13.3 


"3- 1 


4.6 


7.9 


9.5 


14.0 


/ -1 .4 


4.8 


7.9 


9.7 


15.0 


( 1.2 


5.7 


8.0 


10.3 


22.1 




Lowest value: -8.1 



Range = Highest value - lowest value 

= 22.1-(-8.1) 
= 30.2 



Example 9 



Mean 
Deviation 

The arithmetic 

mean of the 
absolute values 

of the 
deviations from 
the arithmetic 
mean. 



The main features of the 

mean deviation are: 
— ' 

All values are used in the 
calculation. 

It is not unduly influenced by 
large or small values. 

The absolute values are 
difficult to manipulate. 

A 




Mean Deviation 



The weights of a sample of crates containing books 
for the bookstore (in pounds ) are: 

103,97, 101, 106, 103 

Find the mean deviation. 
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MD = 



n 



103-102 



+ ...+ 



103-102 



1+5+1+4+5 



= 2.4 



Example 10 
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the ^^^^^^^^^^^^^^H 
mean ^^^^^^^^^^^^^^^^^^^^H 

the 

from ^^^^^^^^^^^^^^^^^^^^H 

the ni t .in ^^^^^^^^^^^^^^^^^^^^B 

^^^^^H Standard deviation: The square 

^^^^^^^H root of the variance. 



Variance and standard Deviation 



The major characteristics of the 

Population Variance are: 



Not influenced by extreme values. 

oThe units are awkward, the square of the 
original units. 

All values are used in the calculation. 



Population Variance 



3-32 



Population Variance formula: 



a = 



g (X - n) 2 
N 



X is the value of an observation in the population 

m is the arithmetic mean of the population 

N is the number of observations in the population 




^^^JfiiJDiKHiJIJdll^^^^ffiiilDn formula: 



a 



4g 



Variance and standard deviation 



3- 33 




a = 



s (X - iQ 2 

N 



(-8.1-6.62) 2 + (-5.1-6.62) 2 + ... + (22.1-6.62) ; 



a 2 = 



25 




42.227 



o = 6.498 



Example 9 continued 
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Sample standard deviation (s) 




Sample variance and standard deviation 



The hourly wages earned by a sample of five students are: 

$7, $5, $11, $8, $6. 
Find the sample variance and standard deviation. 



X= — = — = 7.40 
n 5 

_ ll(x - xf _ (7 - 7.4) 2 + ... + (6 - 7.4) 2 

n^l ~ 5^1 
21 2 

5-1 

|,y = V^ 7 = ^5.30 = 2.3o| 

Example 11 



3- 36 



Chebyshev's theorem: For any set of 

observations, the minimum proportion of the values 
that lie within k standard deviations of the mean is at 

least: 



where k is any constant greater than 1. 



Chebyshev's theorem 



3-37 



Empirical Rule: For any symmetrical, bell-shaped 

distribution: 



o About 68% of the observations will lie within Is 
the mean 



o About 95% of the observations will lie within 2s of 
the mean 



o Virtually all the observations will be within 3s of 
the mean 



Interpretation and Uses of the 
Standard Deviation 
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Bell - Shaped Curve showing the relationship between crand //. I 




|Li-3a (j,-2a |i-la |i |i+la |i+2a |i+3a 



Interpretation and Uses of the Standard Deviation 
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The Mean of a sample of data 

organized in a frequency 
distribution is computed by the 
following formula: 




The Mean of Grouped Data 



A sample of ten 
movie theaters 
in a large 
metropolitan 
area tallied the 
total number of 
movies showing 
last week. 
Compute the 
mean number of 
movies 
showing. 
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IVIovips 
showing 


fVpoiipncv 

f 


class 
midpoint 

X 


(f)(X) 

\J /V 1 -/ 


1 up to 3 


1 


2 


2 


3 up to 5 


2 


4 


8 


5 up to 7 


3 


6 


18 


7 up to 9 


1 


8 


8 


n 




10 


10 


Total 


10 




66 



Al 10 

Example 12 
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The Median of a sample of data organized in a 
frequency distribution is computed by: 



Median — L-\ 

/ 



where L is the lower limit of the median class, CF is the 
cumulative frequency preceding the median class, /is 
the frequency of the median class, and / is the median 
class interval. 



The Median of Grouped Data 



To determine the median class for grouped 

data 

Construct a cumulative frequency distribution. 

Divide the total number of data values by 2. 

Determine which class will contain this value. For 
example, if n=50, 50/2 = 25, then determine which 
class will contain the 25 th value. 
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Finding the Median Class 
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Movies 

snowing 


Frequency 


Cumulative 

Frequency 


1 up to 3 


1 


i 


3 up to 5 


2 


3 


5 up to 7 


3 


6 


7 up to 9 


1 


7 


9 up to 1 1 


3 


10 





Example 12 continued 
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[From the table, L=5, n=10,f=3, i=2, CF=3 



— -CF 10-3 



Median = L + ^ (i) = 5 + -2 (2) = 6.33 

/ 3 



Example 12 continued 
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The Mode for grouped data is 
approximated by the midpoint of the 
class with the largest class frequency. 



r 

The modes in example 12 are 6 and 10 
and so is bimodal. 



The Mode of Grouped Data 



